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ABSTRACT 



Aims. Caustic-crossing binary-lens microlensing events are important anomalous events because they are capable of detecting an extrasolar 
planet companion orbiting the lens star. Fast and robust modelling methods are thus of prime interest in helping to decide whether a planet is 
detected by an event. Cassan introduced a new set of parameters to model binary-lens events, which are closely related to properties of the 
light curve. In this work, we explain how Bayesian priors can be added to this framework, and investigate on interesting options. 
Methods. We develop a mathematical formulation that allows us to compute analytically the priors on the new parameters, given some 
previous knowledge about other physical quantities. We explicitly compute the priors for a number of interesting cases, and show how this can 
be implemented in a fully Bayesian, Markov chain Monte Carlo algorithm. 

Results. Using Bayesian priors can accelerate microlens fitting codes by reducing the time spent considering physically implausible models, 
and helps us to discriminate between alternative models based on the physical plausibility of their parameters. 
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1. Introduction 

Mao & Paczynski (1991) first suggested that observations of 
Galactic gravitational microlensing events could lead to the dis- 
covery of extrasolar planets. Microlensing involves the time- 
dependent brightening and then dimming of a background 
source star as an intervening massive object (the lens) crosses 
the observer line-of-sight. Light rays from the source bend in 
the vicinity of the lens, focusing them toward the observer. 
Since 1994, survey teams such as OGLE 1 (OGLE III, Udalski 
2003) and MOA 2 (Bond et al. 2001) have reported more than 
four thousand microlensing events toward the Galactic bulge 
to date. Several hundreds of these events have been carefully 
selected and densely sampled by follow-up networks such as 
PLANET 3 , //FUN 4 , RoboNet 5 , and MiNDSTEp 6 . Although 
microlensing teams have so far published only nine exoplanet 
detections, the method itself stands out because of its high sen- 
sitivity to low-mass planets with orbits of several astronomical 
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units. It thus probes in the planet mass-separation plane a re- 
gion beyond reach of any other technique, as demonstrated by 
the detection of the very first cool super-Earth, OGLE 2005- 
BLG-390Lb (Beaulieu et al. 2006; Kubas et al. 2008). 

A number of microlensing events exhibit anomalous be- 
haviour (i.e., they cannot be adequately modelled by the stan- 
dard single-lens light curve, e.g., Paczynski 1986) and some 
of these anomalies can be attributed to lensing by binary ob- 
jects. The types of light curves produced by binary lensing form 
a rich tapestry but, in general, binary systems with two equal 
mass components tend to exhibit pronounced, long anomalies 
in their light curves, whereas when the secondary companion 
is only a small fraction of the total mass, the anomalies can 
be quite short and subtle. It is primarily these latter types of 
anomalies that may be caused by star-planet binaries (Mao & 
Paczynski 1991; Gould & Loeb 1992). Nevertheless, because 
the true nature of the anomaly cannot always be established 
while the microlensing event is still ongoing, every binary-lens 
microlensing event constitutes a prime target for planet hunt- 
ing. 

In binary lensing, the lens system configuration delineates 
regions of space on the source plane that are bound by gravi- 
tational caustics. Caustics are closed curves with concave seg- 
ments that meet in outward pointing cusps, defined by the loca- 
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tion where the Jacobian determinant of the lens mapping equa- 
tion vanishes, i.e., are lines of infinite point-source magnifica- 
tion. There are three kinds of caustic topologies, which depend 
on the values of the binary lens mass ratio q and the two com- 
ponent projected separation d in angular Einstein ring radius #e 
(Einstein 1936) 



0e = 



4GM (D S -D L 



D S D L 



(1) 



where £>s, Dl are the observer- source and observer-lens dis- 
tances and M the lens total mass. In the close separation regime 
(cf. Fig. 1 of Cassan 2008), there are three caustics, one central 
(4-cusp) and two (3 -cusp) planetary caustics. In the intermedi- 
ate regime, there is only one (6-cusp) caustic, and in the wide 
separation regime, there is one central and one planetary caus- 
tic (both with 4 cusps). 

In many cases, the source trajectory happens to cross a 
caustic. As the source crosses the caustic curve and enters the 
enclosed area, a new pair of images appears, causing a sud- 
den increase in the observed brightness. In a similar way, when 
the source exits the area defined by the caustics, the two im- 
ages merge and disappear, causing a rapid drop in the observed 
brightness. These dramatic changes in magnification result in 
readily recognisable jumps in microlensing light curves. As 
emphasised by Cassan (2008), the ingress and egress times t m 
and t out may be restricted to within very tight intervals when 
caustic crossing features have been identified in the light curve, 
and thus advantageously used as alternative modelling param- 
eters. 

The new set of binary-lens modelling parameters intro- 
duced by Cassan (2008) have the advantage that two of these 
parameters are very closely related to features that can be di- 
rectly identified in the light curve. Using this new formula- 
tion to analyse the data of OGLE 2007-BLG-472 in its most 
straightforward implementation as a maximum likelihood anal- 
ysis ("minimising ;^ 2 "), Kains et al. (2009) unveiled a subtle 
aspect of binary-lens modelling: relatively improbable physi- 
cal models with very large values of were found with^ 2 val- 
ues lower than other more plausible models. To avoid finding 
parameter combinations that are physically unlikely, dramatic 
progress can be achieved by switching to a Bayesian analysis. 
This is desirable as the Bayesian approach makes use of prior 
information on the underlying physical parameters, while x 2 
says nothing about parameter plausibility. 

In this article, we show how to derive Bayesian priors for 
the caustic-crossing binary-lens parameters defined by Cassan 
(2008). These are based on physical priors on quantities that 
can be estimated from Galactic models or calculated from al- 
ready observed events (sec. 2 and 3). In sec. 4, we describe an 
implementation of this Bayesian formalism within a Markov 
chain Monte Carlo fitting scheme, using in particular priors on 
the Einstein time (time for the source to travel an angular 
distance #e)- 

2. Maximum likelihood versus Bayesian fitting 

Cassan (2008) introduced a new parameterisation of the binary 
lens microlens light curve model that is well suited to describ- 



ing caustic-crossing events. In this formalism, the caustic curve 
in the source plane is parameterised by a curvilinear abscissa 
(or arc length) from to 2. The trajectory of a source cross- 
ing a caustic, which is classically parameterised by its impact 
parameter uo and position angle a, can alternatively be defined 
by giving the values S{ n at ingress and s out at egress 7 . The two 
parameters timing the trajectory, (time to cross one Einstein 
radius) and to (date at minimum impact parameter uq), are then 
replaced by the ingress and egress times t m and t out . The caustic 
curve is specified in the source (i.e., caustic) plane by a com- 
plex function f (s) = £(s) + irj(s) (see sec. 3.2), and once S{ n and 
s out are specified, the source trajectory is fully defined. This bi- 
jective switch of parameters, (wo><Me>*o) (sin,Sout^in, W)> 
takes advantage of the relatively high precision with which t m 
and t ou t can be inferred from the observations (Kains et al. 
2009; Kubas et al. 2005). 

Using these new parameters, Kains et al. (2009) anal- 
ysed the caustic crossing event OGLE 2007-BLG-472. The ap- 
proach taken was a maximum likelihood procedure, quantify- 
ing the "goodness-of-fit" by a x 2 statistic, and minimising the 
X 2 to optimise the fit. A grid search in (d, q) with even spacing 
in log d and \ogq was conducted. For each (d,q) caustic con- 
figuration, a genetic algorithm was used to explore widely the 
remaining parameter space. While (s , i n ,s , ut) covered the full 
range of possibilities, [0,2] x [0,2], t m and t out evolved in very 
tight intervals based on the values inferred from the light curve 
features (caustic crossing magnification peaks). These first fits 
were refined using a Markov chain Monte Carlo (MCMC) algo- 
rithm, again holding (d, q) fixed while optimising the remaining 
parameters. The best-fit models in each of the identified best-fit 
regions were then found by allowing all parameters to vary. 

As expected for binary lens events, the resulting x 2 ( d, q) 
maps uncovered a variety of widely-separated model parameter 
regions where a relatively low x 2 could be achieved. The low- 
est x 2 models corresponded to very low q, in the planet-mass 
regime. But with a short duration between the caustic entry and 
exit, and a planetary caustic size scaling as g 1/2 , these models 
implied an extremely long Einstein time ~ (W _ ^in)/V /2 ~ 
10 4 days, which is very unlikely according to kinematics of 
stars motions within the Milky Way. These best-fit maximum 
likelihood models were therefore rejected on this physical ar- 
gument. This need to reject the lowest^ 2 models highlights a 
weakness in the maximum likelihood approach, which neglects 
prior distributions on the parameter space. On the other hand, 
Bayesian parameter estimation takes proper account of prior 
distributions in the parameter space (see e.g., Trotta 2008, for 
a review of astrophysical applications). 

In a Bayesian analysis, the posterior probability distribution 
over the model parameters is a function of the data D 



P(Q\D) = 



P(D\0)P(0) 
J P(D\0)P(0)d0' 



(2) 



where P{6) is the prior probability distribution on the parame- 
ters, and the denominator partition function ensures proper nor- 
malisation of the posterior as a probability distribution over the 



7 We use the notations "in" and "out" in place of "entry" and "exit" 
of Cassan (2008) to write more condensed formulae. 
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parameters 6. The likelihood P(D\0) is a function of the pa- 
rameters 6 and a probability distribution over the data D. For 
Gaussian measurement errors with N data points having indi- 
vidual standard deviations <tu the likelihood is 



L(0) ee P(D\0) 



exp 



(-k 2 l 



(2»)"' 2 nf,^r 

Since maximising the likelihood corresponds to minimising 



(3) 



(4) 



- 2 In L(6) = x 2 + 2 ^ In a + — In In , 

a maximum likelihood is equivalent to a minimum in x 2 when 
the error bars cti are known, and is essentially a Bayesian analy- 
sis that implicitly assumes a prior that is uniform on the chosen 
parameters intervals. As we show in the next section, assuming 
more realistic priors could substantially affect the fitting pro- 
cess. 



3. A Bayesian prior for (s h 



t) 



3. 1. Distribution of 0i n , s out ) for isotropic trajectories 

A uniform prior probability distribution in the parameter square 
(sin,Sout) is implicit in the maximum likelihood analysis. 
Because of the non-linear correspondence between the two sets 
of parameters, it should correspond to a rather unlikely prior 
for the (uo,a) source trajectory parameters. A more plausible 
prior would for example arrange for the source trajectories to 
be uniformly distributed and isotropic in orientation. 

In Fig. 1, the top panel shows an intermediate caustic with 
d = 1.1 and q - 0,1 (i.e., six cusps, in orange) with several 
crossing trajectories. It can be seen that a straight line may 
cross the caustic at two (black line), four (red line), or six (blue 
line) locations, depending on the number and orientation of the 
cusps. In the bottom panel, ~ 10 4 of these trajectories were ran- 
domly shot and their corresponding position in the (s{ n , s out ) 
square reported, using the same colour convention. Trajectories 
with a single pair of ingress and egress map into unique black 
points, while for red and blue trajectory lines, there are respec- 
tively two and three possible pairs of ingress and egress points. 

We can understand some of the structures in the bottom 
panel of Fig. 1 as follows: vertical and horizontal lines marking 
the s values at the cusps divide the (s{ n , s oui ) square into boxes. 
No trajectories appear in the boxes along the diagonal because 
the caustics curve concavely outward. It is thus impossible for 
a line that enters at some position between two cusps to exit 
at any point between those two cusps. In a similar way, other 
empty regions correspond to ingress/egress pairs that cannot 
be realised by straight lines crossing the caustic. The trajecto- 
ries are seen to bunch up around ingress/egress pairs occurring 
close to a cusp. This happens because any trajectory entering 
close to a cusp is very likely (for a wide range of angles) to also 
exit near the same cusp. 



3.2. Analytical formulation 

We develop a mathematical formulation that allows us to com- 
pute analytically priors on (Sm,Sout)- The lens equation for a 
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Fig. 1. The top panel illustrates the three kinds of possible 
source trajectories crossing a caustic: the black line has a sin- 
gle pair of ingress and egress points, while the red and blue 
lines have, respectively, two and three ingress/egress points. 
The bottom panel shows in the (s{ n , s^ut) square the locations of 
a random distribution of ~ 10 4 of these trajectories crossing a 
(d = 1 . 1 , q = 0, l)-caustic, with the same colour convention. The 
solid vertical and horizontal black line mark the ^-locations of 
the caustic cusps. 



binary lens with separation d and mass ratio q defines the map- 
ping of the position of a point- source ( on the source plane to 
the positions of its three or five images at z on the lens plane 



1 



1 +q 



z z + d 



(5) 



where the more massive body is at the centre and the compan- 
ion on the left-hand side. Following Witt (1990), the caustic 
lines f are parametrised by a parameter (p e [0, 2n] 



1 



l+q 



1 



(z + d) 2 



(6) 



where z and f satisfy Eq. (5). For a given angle 0, it is possi- 
ble to solve a fourth order polynomial equation in z to obtain 
the corresponding caustic points £. While the parameter cp is 
used here to write the useful formulae, in practice we use in- 
stead the equivalent parameter s = s((p) (bijection) introduced 
by Cassan (2008), which has the advantage of sampling the 
caustics evenly. 
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To write more condensed formulae, we use notations that 
resemble two-dimensional vector operations. Given two com- 
plex numbers fi = f i + ir]\ and £2 = £2 + ^2, we write fi A f2 = 
^1^72 - ^71^2 ("wedge product") and f 1 • £2 = £1^2 + mm ("scalar 
product"), which are both real numbers. Moreover, a quantity 
related to a caustic entry (exit) is indicated by a subscript "in" 
("out"). Using the usual convention that uo > when the ori- 
gin of the coordinate system stays on the right-hand side of the 
source trajectory, one can write 



U{) = 



fout A f j n 
I f out _ fin I 



/ ?7out _ ?7in 

a = arctan 

" "fin 



tE = 



\ £out ' 



+ 7lH (fin - fout) » 



to = 



If out fin 
^out + ^in 



' (^out tin) 



1 fout + f in , . £ out A fi r 
+ Z- 



2 fout fin Ifout-finl' 



(7) 
(8) 
(9) 
(10) 



where H is the Heaviside step function, and a = 
I sign(7/ out - r]i n ) if f out = f in . The transformation between the 
two sets of parameters is given by the Jacobian 



d(uo,a,t E ,to) 



d(s 



in? ^oub hn? L out 



^in? ^out) 



(ID 



Since the dependencies of the classical parameters with respect 
to the new ones are uo(s in , s out ), a(s in , s out ), t E (s in? ^out? ^in? ^outX 
and to(s{ n , ^out? ^in? W)» J reads (using cp instead of s) 



dup dup 



'(Pout 
da 



dtE_ 
¥in 

dto 



'(Pout 
dt E 








d(pout dt- m dtout 
dtp dtp dtp 
dtm dt out 



d(uo,a) 



^(0in,0out) 



d(t E Jo) 



dftn^out) 



(12) 



After some algebra, we find for the components of the two lat- 
ter Jacobians 



duo 



duo 
d^out 
da 



da 
d^out 
dtE 
dt m 
dtE 
dt ou t 



duo d^m duo drju 

dfhi #in ^^7in #ii 
(fout — fin) ' fout 



If out _ fin I 
(fout — fin) ' fin 
Ifout-finP 



(r y \ a d£m 
(fout "fin) A—— 

#in 



(fout -fin) A 



Jfo 



#0 



(13) 
(14) 



da drji r 



da d^i r 
<9f in J0 ir 
(fout "fin) A ^ 

Ifout-finl 2 
(fout - fin) A ^gr~ 



Ifout " 
1 



fir 



Ifout-finl 
1 



I fout - fir 



(15) 

(16) 
(17) 
(18) 



= 2 + 



&o_ = 1 

dfout 2 
so that 

3 (wo, or) 



1 fout + fin . fout A fi r 
+ 1- 



2 fout fin Ifout-finl 

1 fout + fin . fout A fin 
+ 1- 



d(0in,0out) 
d(t E Jo) 



2 fout fin |f out - fir 



|(fout " fin) A ^ I I (^ut " fin) A ^ZSL I 



^fen^out) 

which gives 



I fout fin 
1 

I fout — fin I I fout — fir 



dtp dtp 



dt ir 



dtox 



J = 



I (fout - fin) A ^ 1 1 (fout - fin) A | 
Ifout-finl 4 



(19) 
(20) 

(21) 
(22) 

(23) 



The derivatives d£/d(p evaluated at the caustic entry and exit 
are given by (Cassan 2008) 



d£_ = dz + ^ dz 
d(p dcp d(p ' 

where 

dz _ i (z + d) 2 + qz 2 
# ~ 2 ( z + d) 3 +qz 3 



(z + d)z. 



(24) 

(25) 
►0, / 



In the limit of cusp-crossing trajectories, i.e., f ou t - fi 
behaves like l/|f out -f in | 2 . 

As expected, the Jacobian / is a function of the two parame- 
ters (s in , s oui ), while the bijection between the two set of param- 
eters was possible by involving (t m J oni ). However, / is not yet 
the Bayesian prior P{s- m , s oni ) we seek. We have yet to consider 
two aspects. Firstly, the parameters (wo><Me>*o) are themselves 
affected by prior probability distributions; this is discussed at 
the end of this section and is the topic of sec. 4.1. Secondly, 
caustic crossing points are either entries or exits since the tra- 
jectory is orientated from t m to t out (t out >t{ n ), which is not ac- 
counted for in Eq. (23). To solve this second issue, we calculate 
the outward normal vector to the caustics at point f , 



N c = i d i 



(26) 



as well as the normalised and orientated trajectory vector 



fout fin 



(27) 



Ifout-finl 

and check whether N c ,m • N tt m < (inward motion at f in ) and 
Nc,out • N tt out > (outward motion at f ou t)- If these conditions 
are fulfilled, we write P(s[ n , s out ) = /, and otherwise. 

Defined in this way, P(s[ n , s out ) is thus the prior on (s[ n , s out ) 
that we seek, in the special case of isotropic source trajecto- 
ries (uniform distributions for uo and a), uniform microlens- 
ing events rate (^o is a random number), and uniform Einstein 
time tE > 0. In Fig. 2, we have plotted ^(Sm^out) for various 
(J, q) configurations as a function of (horizontal axis) and 
^out (vertical axis), higher values of P appearing in white (lin- 
ear scale). From left to right, these configurations are: (a) in- 
termediate with d = 1.1 and q = 0.1; (b) wide+central and (c) 
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Fig. 2. Bayesian prior P(^in,^out) as a function of S{ n (horizontal axis) and s out (vertical axis), where we have assumed isotropic 
source trajectories and uniform distributions for and event rate. Higher values of P appear in white (linear scale). From left to 
right, the caustic configurations are: (a) intermediate with d=l.l and q = 0.1; (b) wide+central and (c) wide+ secondary caustic, 
both for d = 2 and q = 0.1; (d) close+central and (e) close+secondary caustic, both for d = 0.5 and q = 0.1. 



wide+secondary, both configurations for d = 2 and q = 0.1; (d) 
close+central and (e) close+secondary caustic, both for d = 0.5 
and q = 0.l. One can compare the intermediate case plot with 
Fig. 1. In sec. 4.1, we investigate how assuming different priors 
on the Einstein time affect the prior on (s[ n , s out ). 

3.3. Extended sources 

When the source approaches the caustic curves (at typically 
less than three projected source radii), one needs to take into 
account extended source effects in the modelling. As for t m and 
t out , it is usually possible to extract from the light curve a new 
parameter that can be used instead of the source radius. 

It is well known that when the source crosses a straight 
line caustic (which is in many cases a good approximation of 
a real caustic), one can easily infer the duration of the cross- 
ing from the shape of the caustic crossing feature itself (Cassan 
et al. 2004; Albrow et al. 1999; Schneider & Wagoner 1987). 
Here, we define this duration as the time for the source to cross 
the caustic line by its full radius {i.e., from centre to limb), 
so that At cc = p*/v±. I n this definition, p* is the source ra- 
dius in Einstein ring radius units, v± is the component of the 
source velocity perpendicular to the caustic, and the subscript 
"cc" refers to either the caustic entry ("in") or exit ("out"). For 
a given absolute velocity 1/fe, the source will take longer to 
cross the caustic if the trajectory makes a tangential angle with 
it. More precisely, the normal velocity is proportional to the co- 
sine of the angle between the trajectory and the caustic normal 
v ± = |Af c?cc • A^ cc |/fe. Inserting into this equation the expressions 
for tE, Af c?cc , and N ttCC (Eqs. 9, 26, and 27, respectively), we can 
compute the source radius p* as a function of At cc 



(fout-fm)A^ 

P* = ■ du ■ At cc . 

(^out — ^in) 7bjT~ 



(28) 



This expression would be exact if the crossed caustic were 
a perfect and infinite straight line. In reality, however, caus- 
tic curves always have a curvature, and sometimes the source 
partly crosses a cusp. Nevertheless, there is no arguing that At cc 
is more suitable than p* for parameterising the observed caus- 
tic crossing, since its rough value can be estimated from the 
light curve, in contrast to p* . In practice, we choose the caus- 



tic crossing that provides the most comprehensive data cover- 
age and which ressembles most closely a straight line caustic 
crossing to extract the starting value when fitting At cc . 

4. Markov Chain Monte Carlo fitting 

4. 1. Examples of prior probability distributions 

For a given set of fitting parameters (si n ,s out ,ti n ,t out , Af cc ), the 
prior of the probed model is given by 



P(model) = P(sin, s out ) P(t in , t ouU At cc ) . 



(29) 



The prior ^(sin^out) is computed as explained in sec. 3.2, and 
may include priors that have been defined using the other pa- 
rameters uo, a, tE, to, or p* by properly weighting P^in,^). 
Given Eqs. (9) and (10), a prior P^in^out) is equivalent to 
a prior POoJe) with a corresponding change in the prior 
P($m, ^out)- We now discuss different priors for the various pa- 
rameters that could realistically be used in the Bayesian anal- 
ysis. In Fig. 2 for example, we illustrate the case of isotropic 
trajectories, which corresponds to uniform priors for the pa- 
rameters uo and a. This choice is justifiable, since the direction 
of the binary lens axis is random. 

The first class of priors that we can use are uninformative 
priors. Since the prior expresses information about the values 
of parameters before any data has been taken, we know that pa- 
rameters such as to,tE, or At cc must have uninformative priors, 
because we can only estimate their values by examining the 
light curve. Although it is natural to use uniform priors for to, 
a or uo, for strictly positive parameters such as At cc or tE, it is 
more suitable and commonly decided to use an uninformative 
prior that is uniform in the logarithm of the parameter. 

We illustrate the use of an uninformative prior (uniform pri- 
ors in logfe, in uo, a, and ^o) by computing ^(sin^out) for the 
solution configuration of the binary lens event OGLE 2002- 
BLG-069 (Kubas et al. 2005; Cassan et al. 2004). The config- 
uration for that event was that of a source crossing the central 
caustic of a close binary lens with parameters d = 0.46, q = 0.58 
and ^ut _ ^in - 14.5 days. The resulting prior P(s{ n , s oui ) is plot- 
ted in Fig. 3, where the red cross shows the location of the caus- 
tic crossings at S[ n - 1.3, s out 0.3. This falls within a region 
of high probability, meaning that the corresponding P(s[ n , s out ) 
prior would have been a reasonable choice for this event. 
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Fig. 3. Prior P(^ n ,^out) for the solution configuration of caus- 
tic crossing event OGLE 2002-BLG-069 (close+central caus- 
tic), with an underlying (uninformative) uniform prior in logfe- 
The red cross shows the location of the found caustic crossing, 
(1.3,0.3), which falls in a region of relative high probability. 



The second class of priors are those that we can derive 
using information known before the event is observed. In mi- 
crolensing, a convenient parameter on which such a prior can 
be placed is the Einstein time fe- This parameter depends on 
the relative distances between the source, the lens, and the ob- 
server, the kinematics of both the lens and the source and the 
lens' mass function. Combining all these data can help us to 
determine which ranges of values of are more likely to be 
observed. For the event OGLE-2007-BLG-472 (Kains et al. 
2009), no prior information was included on (or the prior was 
assumed to be uninformative), which cause the best-fit models 
to have unrealistically long fe- 

The method presented here can indeed be extended to in- 
clude informative priors on parameters other than tE, such as 
the source flux distribution, the blending light due to the lens, 
the relative proper motion of the source and lens, or the source- 
radius caustic crossing-time, but this would require us to link 
the analysis to a Monte Carlo model of the Galaxy. Although 
our approach can be generalised to these possible extensions, 
they are beyond the scope of the present paper. Using a l s ° 
has the advantage that its statistical distribution is fairly well- 
constrained by observed single-lens light curves, since this pa- 
rameter is common to single- and binary-lens events. 

Empirical distributions of can be obtained by modelling a 
large number of observed microlensing events. The top panel of 
Fig. 4 shows a histogram (blue rectangles) of values found by 
fitting 788 single-lens microlensing events from the 2006-2007 
OGLE seasons (including blending). As expected, the distribu- 
tion is far from uniform but instead appears roughly log-normal 
with a peak close to logfe - 1-32 and o-\ ogtE ^ 0.4. Theoretical 
distributions of can also be based on predictions obtained 
with a Galactic model, such as the distribution advocated by 
Wood & Mao (2005). This is plotted as a solid black line on 
top of our histogram (Fig. 4, top panel) and is seen to closely 
match the empirical distribution. Nevertheless, the distribution 
of Wood & Mao (2005) lacks both extremely long events (say 
tE > 300 days) that can be interpreted as black hole lenses, and 
extremely short events (say < 3 days) that can be interpreted 



Histogram of t E 




Fig. 4. In the top panel, the histogram (blue rectangles) shows 
the distribution of found after fitting 788 single-lens mi- 
crolensing events from OGLE 2006-2007 seasons. The solid 
black line shows the model prediction of Wood & Mao (2005), 
which is in good agreement with the data. The bottom panel 
displays the prior P(s{ n , s out ) for the same intermediate caustic 
configuration as for Fig. 2 (d = 1.1, q = 0.1), but assuming an 
underlying prior for tE given by the above distribution. 



as evidence of a population of free floating planets. But selec- 
tion effects cause these extreme events to be under-represented 
in the observed tE distribution, as can be seen in Fig. 4. For 
these exceptional cases, special treatment would be required, 
for example using a prior on tE that is more generous to ex- 
treme values in an attempt to compensate for selection effects. 
For most of binary lens events, however, a mild discrimination 
against black hole or loose planet lenses seems appropriate. 

Using the Wood & Mao (2005) distribution as a prior, 
we compute and plot (Fig. 4, bottom panel) the correspond- 
ing distribution P(S[ n , s out ) by assuming (t out - P m ) = 20 days, 
d- 1.1, and q = 0.1 (the same intermediate configuration as 
Fig. 2). Figure 4 (bottom panel) shows that with this prior, 
cusp-crossing trajectories are far less likely to happen. For a 
trajectory near the cusps, this is because the source has only 
a short distance to travel between the entry and exit, while 
(W _ tin) is constant, meaning that the source's motion has to 
be very slow, leading to large values of tE, which are now ruled 
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out by the prior 8 . This effect can be seen directly in the plot 
of P(s[ n , s out ), where strong "wing" features at the cusps disap- 
pear, and other features appear (compare with Fig. 2). 

4.2. Posterior probability distributions: MCMC fitting 

In practice, these and other statistics related to the poste- 
rior parameter distribution can be evaluated efficiently us- 
ing a Markov chain Monte Carlo to evaluate the probability- 
weighted integrals in B ayes' theorem. A random walk in the 
parameter space is undertaken by taking random steps drawn 
from a distribution of the parameters 6. Each proposed step is 
accepted or rejected based on the probability of the new point 
relative to the old one exceeding some threshold, which is ad- 
justed to maintain the acceptance rate above roughly 20-30%. 
The resulting chain locates and wanders around a local mini- 
mum, sampling the parameters with a weight proportional to 
the posterior probability. 

For a maximum likelihood analysis, the relative probability 
used to accept or reject new steps is exp |-A^ 2 /2j alone, where 
Ax 2 is the x 2 difference between the new and old points; in a 
full Bayesian analysis, we multiply this exponential factor by 
the ratio of new to old values of the prior P(model), following 
Eq. (29). The posterior probability that the parameters lie in 
a defined region © is then 

P(0e®)= f P{6\D)&6. (30) 
J© 

The expected value of any function of parameters, g(0), is 

(g) = J g(0)P(O\D)d0, (31) 

and the variance about that expected value is 

Var [g(6)] ee J (g(6) - (g)) 2 P(0\D)&0. (32) 

In a similar way, confidence intervals, parameter covariances, 
and confidence intervals can all be evaluated easily in the usual 
manner given the posterior probability distribution found with 
the MCMC algorithm, providing us with a complete statistical 
picture of the parameter space that we explore. 

5. Conclusion 

We have investigated plausible priors for Bayesian analysis of 
caustic-crossing microlensing light curves, based on an alter- 
native parameterisation introduced by Cassan (2008). We have 
developed a mathematical formulation that allows us to com- 
pute analytically Bayesian priors for these parameters, given 
the knowledge we have about the physical quantities on which 
they depend. A number of relevant priors that may be used in 
a Bayesian, Markov chain Monte Carlo implementation of the 
given equations have been explored. 

8 More precisely, when — > 00 , Wood & Mao (2005) distribution 
behaves like ~ If out _ £inl 3 > and since / ~ 1/ If out - finl 2 > the net 
result is that near cusps, / ~ |f out - £ n | — > 0. 



In the context of the rapid development of a new generation 
of networks of classical and robotic telescopes (e.g., Tsapras 
et al. 2009), as well as space-based observations such as with 
the ESA project satellite Euclid (Beaulieu et al. 2010), a cur- 
rent challenge facing the microlens planet search community is 
to fully automate the fitting of binary lens light curves in real 
time, after having detected an anomaly (e.g., Home et al. 2009). 
This would enable anomalies that are detected in the observed 
light curves to be characterised as quickly as possible and for 
us to ascertain whether the anomalous behaviour is caused by 
a planet-mass companion of the lens star. Identifying parame- 
ters that could be estimated automatically by analysing the light 
curve (e.g., a magnification jump due to a caustic crossing) is 
already a step forward in accelerating the fitting codes by ex- 
ploring a far more tighter parameter space. This was the moti- 
vation of Cassan (2008) in defining a new set of parameters. In 
this work, we have added the possibility of including Bayesian 
priors in the analysis, which would avoid the need to explore 
combinations of parameters that are unlikely to happen. 
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